Boosted regression trees with errors in variables.
نویسندگان
چکیده
In this article, we consider nonparametric regression when covariates are measured with error. Estimation is performed using boosted regression trees, with the sum of the trees forming the estimate of the conditional expectation of the response. Both binary and continuous response regression are investigated. An approach to fitting regression trees when covariates are measured with error is described, and the boosting algorithms consist of its repeated application. The main feature of the approach is that it handles situations where multiple covariates are measured with error. Some simulation results are given as well as its application to data from the Framingham Heart Study.
منابع مشابه
Boosted trees for ecological modeling and prediction.
Accurate prediction and explanation are fundamental objectives of statistical analysis, yet they seldom coincide. Boosted trees are a statistical learning method that attains both of these objectives for regression and classification analyses. They can deal with many types of response variables (numeric, categorical, and censored), loss functions (Gaussian, binomial, Poisson, and robust), and p...
متن کاملThe identification of complex interactions in epidemiology and toxicology: a simulation study of boosted regression trees
BACKGROUND There is a need to evaluate complex interaction effects on human health, such as those induced by mixtures of environmental contaminants. The usual approach is to formulate an additive statistical model and check for departures using product terms between the variables of interest. In this paper, we present an approach to search for interaction effects among several variables using b...
متن کاملComparing Different Modeling Techniques for Predicting Presence-absence of Some Dominant Plant Species in Mountain Rangelands, Mazandaran Province
In applied studies, the investigation of the relationship between a plant species and environmental variables is essential to manage ecological problems and rangeland ecosystems. This research was conducted in summer 2016. The aim of this study was to compare the predictive power of a number of Species Distribution Models (SDMs) and to evaluate the importance of a range of environmental variabl...
متن کاملPredicting the Internet’s Evolution with Decision Trees and Lasso Logistic Regression Models
The Internet self-evolves rapidly and its dynamic structure poses many interesting questions for researchers in network analysis. In this paper I show how we can simplify the entire Internet as a mathematical graph and then extract its structural characteristics; these characteristics in turn help us build statistical models that can predict how the Internet will evolve. The data describing the...
متن کاملWeight Trimming and Propensity Score Weighting
Propensity score weighting is sensitive to model misspecification and outlying weights that can unduly influence results. The authors investigated whether trimming large weights downward can improve the performance of propensity score weighting and whether the benefits of trimming differ by propensity score estimation method. In a simulation study, the authors examined the performance of weight...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Biometrics
دوره 63 2 شماره
صفحات -
تاریخ انتشار 2007